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1.  Introduction 

An  experimenter  is  asked  which  of  k populations  has  the  largest  mean 
and  must  decide  how  large  a sample  he  should  take  to  decide  this  question. 
Taking  a large  sample  decreases  the  probability  of  an  incorrect  decision, 
but  at  the  same  time  increases  the  cost  of  sampling.  It  seems  reasonable 
that  the  "optimum"  sample  size  should  depend  both  on  the  cost  of  sampling 
and  the  amount  of  use  to  be  made  of  the  decision.  In  this  paper,  loss 
functions  are  set  up  which  take  into  consideration  the  amount  of  use  to 
be  made  of  the  result,  the  cost  of  making  a wrong  decision  and  the  cost 
of  sampling.  Assume  that  the  parameters  are  random  variables  and  only 
partial  prior  information  is  available.  The  r-minimax  criterion  allows 
one  to  determine  a sample  size  that  minimizes  the  maximum  expected  risk 
over  r which  is  a class  of  prior  distributions.  Note  that  if  r consists 
of  a single  prior,  then  the  r-minimax  criterion  is  the  Bayes  criterion 
for  that  prior.  At  the  other  extreme,  if  r consists  of  all  priors  then 
the  r-minimax  criterion  is  the  usual  minimax  criterion.  Some  statements 
for  the  development  of  r-minimax  criterion  have  been  discussed  by  Gupta 
and  Huang  [3] . Dunnett  [1]  discussed  an  optimal  sampling  problem  for  the 
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normal  means  problem  by  the  usual  rainimax  and  Bayes  criterion.  Ofosu 
[4]  considered  the  minimax  criterion  for  gamma  populations.  In  Section 
2,  we  discuss  the  location  parameter  problem.  Sclae  parameter  problem 
is  considered  in  Section  3. 


2 . Location  Parameters 

Let  there  be  k(>_  2)  independent  populations  with  continuous  distri- 
bution functions  FCx-ep,  F(x-o,) , . - . , F (x-e^) , where  the  location  parameters 

Sj's  are  unknown.  Let  denote  n independent  observations  from 

- ' 1 n 

the  ith  population  and  define  X . * — £ X. . , (1  £ i <_  k) . It  is  well 

n 1=1  1J 

known  that  the  distribution  of  X. . - 0.  does  not  depend  on  9.(1  < i < kl . 

i j i r \ — — 

Define 


c„(yjF)  . P(i  I(*irei)*....(xln-81)J 

Then  for  any  i , 1 <_  i <_  k , 

P{X.  < x)  = G {x-e. IF). 
i — n l ' 

We  wish  to  select  the  population  associated  with  the  largest  e^'s  using 

the  usual  selection  procedure.  We  wish  to  determine  the  optimal  sampling 

by  F-minimax  criterion.  Let  6^  denote  the  probability  of  selecting  the 

ith  population.  Define  the  usual  procedure  as  follows: 

!1  if  x • raax  x • > 
l<j<k  } 

0 


* 


I 


i. 


Then  the  probability  that  the  ith  population  is  selected  is  given  by 
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p . -PC  max  X.  < X . } 

1 l<fck  * 

j/i 


k 

l 

5=1 

jri 


G (x+6,-9. | F)  dG 
r.  i 1 1 n 


£x|F). 


Let 


{ej9_  = (e1,e-,...>ek)}  and  f<. 


< ej  e± 


max  0.  +£},  i=l,2,.,.,k, 
l<j<k  3 
jjii 


and  A is  a given  positive  constant.  Then  fi  * fl_  U fijU • • .U^,  where 
is  that  part  of  Q usually  called  indifference  zone. 

For  6_  £ , 1 i <_  k,  define  L^((J,5.)  = 0 for  all  rj*i, 

L ^ ( 6_,  5 . ) = c ' (0  j - 0^)<$.,  j - l,2,...,k,  where  represents  the 

loss  for  Q_  £ when  the  jth  population  is  selected,  c'  being  a positive 
constant.  For  ££  £1^,  the  loss  is  zero. 

The  probability  of  making  a wrong  decision  can  be  decreased  by 
increasing  the  size  of  the  experiment  on  which  the  decision  is  to  be 
based,  but  this  increases  the  cost  of  experimentation,  which  must  also  be 
considered.  It  will  be  assumed  here  that  the  cost  of  performing  an 
experiment  involving  n observations  from  each  population  is  cn,  where  c 
is  a positive  constant.  Let  o be  a distribution  over  Q.  Then  the  risk 
function,  or  the  expected  l^.ss,  with  experimentation  costs  included  is 

k k 


Y_.(y)  = cn  + c'  l J f / , (0 .-9.)S.(x)dF  (x)dp(6). 

n i=l  j*l  n.  Ek  1 3 } ~ ^ 

J i 

Assume  that  partial  information  is  available  in  the  selection  problem,  so 

k 

that  we  are  able  to  specify  it.  = p{8_  6 s\.j,  £ it.  = 1.  Define 

1 1 i=0  1 

T = { p (6) | J dP(0)  = r i = l,2,...,k). 

n. 

1 


We  know  that 


k k 

Y (p)  * cn  C ? / j • 

n i*l  j-1  Qj  1 J J ~ 

For  any  i,  1 ^ i ^ k,  let  0?  be  some  point  in  fl.  such  that 

k k 

sup  l (e.-e.)?,  = T (et-et)p* 
e€s.  j=i  } J j=i  1 J i 


where  0?  = (0| ,8* , . . . ,6£)  and 


Pt  = J 


a>  k 

f 

r n 

~o»  j = I 

j.*i 


G (x+e?~9t)dG  (x'i . 
n i j nl 


sue  y fp)  = cn  + c1 
o6r  n 


! . / (6t  -9*)pt . 

i=l  5 1 3 J 


For  8(  8.,  1 < i <_  k,  let 


and  = 6^-0^ , then 


(W-  -'V  = C0i-0j>Pj. 


R 1 C*ii**i2*-*-8ik>  = J.  *ij  / .n,  Gn{x+gij)dGn(x)- 

j = l J -«  i=l  J 

ji*i 

Note  that  all  the  g.  are  positive,  bv  definition,  and  e . = 0, 

We  first  require  to  determine  the  values  of  the  parameters  in  ft;  for 
which  is  a maximum.  Somerville  [5]  shows  that  this  is  achieved  when 

g}j(j?*i)  ate  positive  and  equal,  while  gj^  = 0.  Denote  the  common  value 
of  the  positive  by  g.,  then 


R = .1,  SiPj  = Mi-pp. 

i=i 

If  we  denote  by  the  maximum,  with  respect  to  y,  of  the  function  , 

then  the  maximum  risk  is  given  bv 
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' 

► 

> , 


S ' > 

Y (v)  — Cri  4 C1  r &A 

r ii  ; “ 1 1 f*t 

per  i-l 

Since 

OC 

R{°  = g.U  - / Gk_1(x  + g.)ciC  (X)} 

' n &i-'  n 

hence  we  know  that 

sup  R^  = — =sup  ll(kj  = sup  R = JL 
g,!'-  8k_>A 

k 

Thus,  sup  v (p)  = cn  + c'(  l 

»£r  n i=i  1 :- 

= cn  * c-(1-tt0)R>1. 

For  the  problem  of  normal  distributions  N(G.,1),  i = 1,2,. ...k,  so  that 

2 

we  are  interested  in  the  selection  of  the  means  0.,  the  function  R 

i 

becomes 

R = g[l-«k_11/2(g/|,...,g/ j)], 

where  is  the  (k-1) -variate  normal  distribution  with  all 

correlation  coefficients  equal  to  If  we  denote  by  Mk  ^ the  maximum, 
with  respect  to  v (>_  A) , of  the  function 

-VJ1* 

Then  the  maximum  risk  is  given  by 

sup  y (o)  = cn  ♦ -l—  c’ 

per  r‘  fH  0 k'1 

* 2 j_ 

* *)  “)  7 1 

This  car.  be  minimized  with  respect  to  n by  taking  n = < (— — j)  s, 

where  <x>  is  the  smallest  integer  greater  than  or  equal  to  x.  The  values 
Mj._j  have  been  discussed  by  Dunnett  [1],  The  values  of  ^ } jy^(y.  ■■•,>') 
are  also  tabulated  in  Gupta  [2]  for  k = 2(i)i3. 


By  using  Somerville  table  [S]  for  the  values  of  H.  ^ 
we  compute  some  n values  as  follows: 


Table  I 

Minimum  Sample  Sizes  for  the  Normal  Means  Problem 


2 

2 

7 

4 

7 

w 

2 

3 

4 

| l 

2 

2 

3 

Note  that  we  choose  A 


{ 
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3*  Scale  Pa rgjge t e v s 

We  assume  that  the  V.  independent  populations  have  continuous  distri- 

bution  functions  F(J-),  F(|-)  , . . . ,F(|~) , respectively,  where  the  scale 
1 2 ”fc 

parameter  0.  is  positive  and  unknown  and  s > 0.  Let  X X.  denote 

1 — i * m 

i n 

r independent  observations  from  r,  and  devitie  X.  = — J X..,  1 < i < k. 


As  before,  we  know  that  for  any  i,  1 < i < k. 


i n it 

J = 1 1 


P(Xi  < x)  - Hn(|-|F). 

i 

Let  ii(x)  denote  the  same  procedure  as  before.  Then  the  probability  q. 
that  tv ^ is  selected  is  given  by 

q.  = Pi  max  X.  < X.) 

Ki<k  J 1 

T*i 

* k e, 

= 5 H « (x  • jF)dH _(x{F) . 

-*  j=l  j n 

Let  {L  » £ 5 niax  9.5,  i = and  5(>I)  is  a given  constant 

l<j<k  J 

For  0 t n.,  1 < i < k,  define  L{r}(0,6  ) = o for  all  rti, 

l-(l)C£,«:j)  = log  j * 1,2 k, 

where  represents  the  loss  for  9 £ ft.  but  the  jth  population  is 

selected,  c'  being  a positive  constant.  For  0_ £ Qq,  the  loss  is  zero. 

By  using  similar  discussion  as  before,  for  any  i,  1 < i < k,  let 


be  some  point  in  Si.  such  that 


*S  ih  r>  108  r * j,  1 108  §f 


where  e|  = (&*  }Q% , . . . ,dp  and 


= f ,®  V*  ■ ef  )dVx)- 


3 = 1 " J 

Then  j/i 

k k 6* 

sup  V (p)  = cn  ♦ c'  ; it.  V qt  log  —•  , 

per  n i-1  1 j = i 3 3 

where  F is  defined  as  before. 


Let 


, . , k 6 , 

QU)(6  • . ..,8.)  - T q log  ~ , 

1 ^ k =l  i uj 


ind  h. . = 0 . /Q  . , then 

i]  i ]' 

>(i), 


k 


vo  3k 


»»ik)  = 1 (log  b ) j n Hn(**hij)dGjiCx). 

i=i  -00  j=i 

3*i 

Note  that  all  the  h. . are  positive,  bv  definition,  and  h.  . - 1.  By  using 

i j » t 

tne  similar  discussion  as  in  Section  2,  we  have 


sup  Yh(p)  = cn  + c*  (1-^q)^, 


Q(1)=...  = sup  Q{k)=  (log  h)tl-f  (xh)dH  (x) ) . 

0 


e£r 

where  Q,.  = sup  Q = sup 
h>6  h j > 6 

For  the  selection  of  the  scale  parameters  of  gaiima  populations  with 
densities 

1 ,x  .a-1  i x , _ 

eT  ’Ha)  explT'‘  x > c* 

where  e^,  1 i < k,  are  the  unknown  scale  parameters,  > 0 and  a(>0) 
is  a known  shape  parameter.  Let 


CO 

Q _ - cn  + C(1-b  )(log  h)[l  j G )dG  (x)] , 
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where  G (x)  is  the  cdf  of  a standardized  gamma  random  variable  with  y=na. 


Let  he  the  maximum,  with  respect  to  h(>5),  of  the  function 


[1  - j C^~*(hx)dG  {x)]log  h, 
0 ^ 


then  the  max i.-aura  risk  is  given  by 


% = cn  + c’  (1"ro'Nk-r 

It  is  not  analytically  feasible  to  minimize  with  respect  to  n by- 


differentiation. We  car,  use  the  same  method  as  Ofosn  [4}  to  make  a 
numerical  study.  Some  asymptotic  r-minimax  solutions  are  discussed  as 
follows. 


It  is  known  that  as  v tends  to  <*>, 

1 

log(X./(ae,)},  (i  = 1,2,.. -,k) 


is  asymptotically  distributed  as  N(0,1),  Hence,  as  v ®,  •*;>  have 


sup  yn(p)  = cn  ♦ c'(l - m0)  2(2v-l)‘1Mk_1, 


where  M.  = sup  h { 1 - 4 , . /9 (h , . . . , h) } . Then  sup  y (p)  can  be  minimized 

h>^  ptr  n 


with  respect  to  n.  For  large  by  taking 


n = 4 {/|'(l-i.Q)4Mk_1  + i}>, 


where  <x>  is  the  smallest  integer  greater  than  or  equall  to  x. 

Applications  to  selection  of  Weibull  populations  scale  parameters  and 
normal  variances  problems  can  be  obtained  in  the  same  way  as  in  Ofosu  (4j. 


i ! 
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